Not-So-Linked Solution to the Linked Data Mining Challenge 2016
نویسنده
چکیده
We present a solution for the Linked Data Mining Challenge 2016, that achieved 92.5% accuracy according to the submission system. The solution uses a hand-crafted dataset, that was created by scraping various websites for reviews. We use logistic regression to learn a classification model and we publish all our results to GitHub.
منابع مشابه
Can You Judge a Music Album by its Cover?
In this work we explore the potential role of music album cover arts for the task of predicting the overall rating of music albums and we investigate if one can judge a music album by its cover alone. We present the results of our participation to the Linked Data Mining Challenge at the Know@LOD 2016 Workshop, which suggest that the the cover album alone might not be sufficient for the rating p...
متن کاملThe Effect of Transitive Closure on the Calibration of Logistic Regression for Entity Resolution
This paper describes a series of experiments in using logistic regression machine learning as a method for entity resolution. From these experiments the authors concluded that when a supervised ML algorithm is trained to classify a pair of entity references as linked or not linked pair, the evaluation of the model’s performance should take into account the transitive closure of its pairwise lin...
متن کاملThe Linked Data Mining Challenge 2014: Results and Experiences
The 2014 edition of the Linked Data Mining Challenge, conducted in conjunction with Know@LOD 2014, has been the third edition of this challenge. The underlying data came from two domains: public procurement, and researcher collaboration. Like in the previous year, when the challenge was held at the Data Mining on Linked Data workshop co-located with the European Conference on Machine Learning a...
متن کاملA Hybrid Method for Rating Prediction Using Linked Data Features and Text Reviews
This paper describes our entry for the Linked Data Mining Challenge 2016, which poses the problem of classifying music albums as ‘good’ or ‘bad’ by mining Linked Data. The original labels are assigned according to aggregated critic scores published by the Metacritic website. To this end, the challenge provides datasets that contain the DBpedia reference for music albums. Our approach benefits f...
متن کاملThe Linked Data Mining Challenge 2015
The 2015 edition of the Linked Data Mining Challenge, conducted in conjunction with Know@LOD 2015, has been the third edition of this challenge. This year’s dataset collected movie ratings, where the task was to classify well and badly rated movies. The solutions submitted reached an accuracy of almost 95%, which is a clear advancement over the baseline of 60%. However, there is still headroom ...
متن کامل